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Abstract 

Kingman’s model of selection and mutation studies the limit type value distribution 
in an asexual population of discrete generations and infinite size undergoing selection 
and mutation. This paper generalizes the model to analyse the long-term evolution of 
Escherichia, coli in Lenski experiment. Weak assumptions for fitness functions are pro¬ 
posed and the mutation mechanism is the same as in Kingman’s model. General macro¬ 
scopic epistasis are designable through fitness functions. Convergence to the unique 
limit type distribution is obtained. 


Keywords: Selection-Mutation balance, House of Cards, Type distribution, Fitness func¬ 
tion, Experimental evolution, Lenski experiment, Escherichia, coli. Condensation. 

MSC Subject classification: 92D10 (primary), 37N25, 60F05, 92D15 (secondary) 


1 Introduction 

Evolutionary forces in a population vary from macroscopic scale to microscopic scalebinclud- 
ing random environment, migration, natural selection, macroscopic epistasis (or individual 
interaction), microscopic epistasis, and linkage and dominance, clonal interference, muta¬ 
tion, genetic drift, recombination, and so on. Recently, mathematicians are interested in 
incorporating as many factors as possible in an evolutionary model, either deterministic or 
stochastic, to understand the contribution of each factor and to see which state the model 
can reach in the limit (see for example [15, 4, 5] among numerous works). However one 
would expect a high level of complexity of modelling and analysis when many factors enter 
into play. 

Kingman [10] suggested that one can regard an equilibrium of the evolutionary model as 
existing because of two preponderant factors, other phenomena causing perturbations of the 
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equilibrium. The pair of factors in his model are selection and mutation. This particular 
case had also been the subject of study of Moran ([12, 13, 14]) almost at the same time. 

More specifically, Kingman [10] proposed a one-locus, discrete generation model under 
selection and mutation with an infinite number of possible alleles which have continuous 
effects on a quantitative type. The continuum-of-alleles models were introduced by Crow 
and Kimura [6] and Kimura [9] and are used frequently in quantitative genetics, since types 
usually have a polygenetic basis. 

Kingman’s idea can be applied to model the Lenski experiment which investigates the 
long term evolution of E.coli in the laboratory. Indeed, the application goes to various 
evolutionary models and one major parameter is how selection influences the population. 
That generates many variants of Kingman’s model and a general treatment is required. 

The paper aims to establish a general model which covers the Kingman’s setting and can 
be applied to Lenski experiment. In section 2, we show briefly the Kingman’s model and 
the main observations in Lenski experiment. This section is the motivation of the paper, 
but the reader can skip it for the first reading since we will come back for applications. In 
section 3, we introduce a general setting with 3 assumptions on the fitness function. We 
give the main results for the general model when some or all assumptions hold. Section 4 
is devoted to proofs and in section 5, we show the applications to Kingman’s model and 
Lenski experiment. Section 6 summarizes the main contribution of the paper and discusses 
the comparison of our model with other works, especially with [8] on Lenski experiment. 


2 Kingman’s model and Lenski experiment 

2.1 Kingman’s model 

The model considers an effectively infinite population that reproduces asexually and has 
discrete generations. It studies a specific type and the selection influences the population 
through the fitness which is the offspring size and depends (possibly not only) on its type 
value X, a real number in the space A4 := [0,M] c Mq where M is a positive real number. 
Let P(A4) denote the set of probability measures on A4. For any u e P(A4), let niu denote 
the upper limit of the support of u, i.e., niu = sup{x : x e M. , u{[x - e , x]) > 0, VO < e < x}. So 
rriu is the largest type value an individual can take in a population with type distribution 
u. 

Assume that each individual per generation mutates independently with probability /3 
(0 < /3 < 1) and the mutant type distribution is the probability measure q on Ai, independent 
of parent’s type. Kingman [10, 11] argued that the tendency for most mutations to be 
deleterious might be reflected in a model in which the gene after mutation is independent 
of that before, the mutation having destroyed the biochemical “house of card” built up by 
evolution. 

The fitness function in this model is x x,x € A4. Let {pi)i>o denote the sequence of 
type distributions of generations i on A4 with po given as a parameter; Then {pi)i>o are 
defined recursively: 

Pi{dx) = (1 -/3) +I3q{dx),i> 1- (1) 

J xpi-i{dx) 
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In particular, we set Pi{dx) = (1 - f3)pi-i{dx) + f3q{dx), if pi-i = 5o, the Dirac measure at 

0 . 

Remark 1. Due to the expression of (1), it is clear that mp^ < max{mp(,,mq} for any i>0. 
So letting M - max{mp(,,mg} and M. - [0,max{mpi3,mq}] does not change any pi. Since 
niq < rup^, one can assume mg < rupg, otherwise we take pi as pq. For convenience, we 
introduce: 

Convention (*): 

mg < mpg,M = mpg and M. = [0, mpg\. 

If a sequence of measures (not necessarily probability measures) {hi)iyo converges in total 
variation sense to a measure h, that is, the total variation oi hi - h tending to 0, then for 
abbreviation, we say {hi)i>o converges strongly to h. 

Kingman specihcally takes M = 1 in his model. Based on the value of f Kingman 

([10]) proved that: 


Theorem 1 (Kingman). Case > /3 Then {pi )j >0 converges strongly to 

\ I3sq{dx) 

P \dx) = --——, 

s - (1 - p)x 

with s being the unique solution of f = 1. 

Case 2: f Then {pi)i>o converges weakly to 


p*{dx) 


j3q{dx) 
1 - X 


(i_ r 

J 1-y 


here 6i{dx) is the Dirac measure at 1. 


Therefore the sequence {pi)i>o converges at least in the weak sense to a limit distribution 
p* which depends only on q and /3, regardless of the specific form of pq. Biologically, it can 
be seen as a stability property of the population. 

Next we introduce the Lenski experiment and use an iteration similar to (1) to model 
the evolution of E.coli. 


2.2 Lenski experiment and modelling 

The Lenski experiment is a long-term evolution experiment with E.coli, founded by Richard 
E. Lenski in 1988 in the laboratory. The experiment is decomposed into daily cycles. Every 
day starts by sampling approximately 5 ■ 10® bacteria from those available in the medium 
that was used the previous day. This sample is then transferred to a new glucose-limited 
minimal medium and reproduce (asexually) until the medium is deployed, i.e., when there 
is no more glucose available. Around 5 • 10® cells are present at the end of each day. So 
the size grows by approximatively 100 times from the beginning of a day to the end and 
a sample of percentage around 1% will be chosen for the next day. The closely 30-year 
ongoing experiment has run more than 60,000 generations. We refer to [8] for a more 
detailed presentation and references therein. 
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There are 12 populations founded from a common ancestor. Samples, called by Lenski 
“fossil record”, are frozen every 500 generations. Once bacterium is frozen, we consider 
it stopping biological activities inside the body, which is how the name “fossil record” 
makes sense. The records are regarded as stocked information of evolutionary trajectories 
of populations. 

They define the fitness as the dimensionless ratio of the competitors’ realized reproduction 
rates. Basically, we let two populations of the same number of individuals, one of the founder 
ancestors and one of evolved strain, to be together in a medium at the beginning of a day. 
The fitness of the evolved strain is the ratio of the (exponential) reproduction rate of the 
strain observed at the end of the day and the reproduction rate of the ancestor strain. 
So in this definition, fitness is a relative quantity that measures the reproduction rate of 
the whole population. However mathematically, one can directly model the natural (non- 
relative) reproduction rate of each bacterium. To unify notations, we shall consider the 
natural reproduction rate as the type and our fitness, different from that of Lenski, is the 
offspring size in the next generation. 

Wiser et al [16] showed that the (relative) reproduction rate increases but decelerates. 
They compared the hyperbolic and sublinear power law increasing models, the former having 
a bound and the latter none. It turns out that the hyperbolic model fits to the first 10,000 
generations, but for a long term about 50,000 generations, the sublinear power law model 
is more significant. 

The unboundedness of the sublinear power law curve can be explained by the fact that 
highly beneficial mutations happen rarely but consistently and with probability 1 some of 
them fixate the population, although after a probably very long time. Therefore, on a very 
long period of time, one can consider that there is a bound (or even a pattern) for the 
reproduction rates of new mutants. 

More specifically, let po be the initial type distribution (or reproduction rate distribution) 
and q the mutant type distribution such that 0 < niq < rrip^ < oo. Let M,Ai be defined from 
PO; ^ by convention {*). Let the population grow exponentially according to the reproduction 
rate of each individual until the total amount reaches the capacity 7(7 >1) (assuming the 
initial amount 1). In Lenski experiment, 7 ~ 100. We then sample I /7 proportion of the 
population at the end of day 0 to constitute the population at the beginning of day 1 . 
However, we have new mutants arriving along the whole day. To combine the mutation 
and selection together, we assume that a /3(0 < /3 < 1 ) proportion of the sample consists of 
mutation population with type distribution q and the rest 1-/3 proportion stays unchanged. 
For any u e P(A1), let be the unique solution of f e*“’’’u(dx) = 7 . The type distribution 
Pi at generation (the beginning of day) i is determined as follows: 

e^Pi-i ^ 

Lenski model : pi{dx) = {1 - (5) - pi-i{dx) + I3q{dx), i > l,x € A4. (2) 

7 

Given type distribution pi-i, tp^_^ is the time the population needs to reach capacity 7 within 
one day. The fitness function is given hy x ^ e^Pi-i^,x € Ai. A division by 7 represents the 
uniform sampling of proportion I /7 from the population at the end of the day. 

It turns out that Kingman’s method cannot be simply applied to (2). To find a solution, 
we will establish a more general model which comprises ( 1 ) and ( 2 ) and find a general 
solution using a different approach. 
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3 General model and main results 


3.1 Modelling 


Let po be the initial type distribution and q the mutant type distribution such that 0 < 
rUg < rupg < oo. Let M,A4 be defined from pQ,q by convention {*). Let /3(0 < < 1) be the 

probability of mutation per capita per generation. In the rest of the paper, are 

fixed. 

Let w{x,u) be the (nonnegative) fitness function of an individual with type value x € A4 
and type distribution u e P(A1). Assume that given u, w{x,u) is a measurable function on 
Ai. Then the type distribution pi is defined via a recurrence relation; 


Pi{dx) = (1 -^) 


w{x,pi-i)pi-i{dx) 

f w{x,pi-i)pi-i{dx) 


+ I3q{dx),i > 1. 


In particular, if / w{x,pi-i)pi-i{dx) = 0 for some i, then 


(3) 


Pi{dx) = {I - fi)pi-i{dx) +I3q{dx). 


Remark 2. Note that the fitness w{x,pi-i) depends on pj_i only in Lenski model, not in 
Kingman’s model. 


We introduce a new notion, namely the “selective advantage” of an individual with type 
value X at generation i-l\ 


So (3) can be written as 


s{x,pi-i) 


w{x,pi-i) 

J w{x,pi.i)pi.i{dx)' 


(4) 


Pi{dx) = {I- fi)s{x,pi-i)pi-i{dx) + fiq{dx),i > 1. (5) 

One can see that x is fitter than y if and only if s{x,pi-i) > s{y,pi-i). 

We would like to achieve that {pi)i>o converges to a limit distribution given any param¬ 
eters po,q. To this end, it is not surprising that one needs some assumptions on w. The 
in-total 3 assumptions will be introduced and discussed in the next subsection where main 
results will also be announced. 


3.2 Main results 
3.2.1 Assumption 1 

We start with some notations. For any p e P(At), the distribution function Dp is defined 
as:Dp{x) = p([0,x]) for any x e M. For any e P(At), say u is stochastieally dominated 
by V, denoted by u < u, if and only if Du{x) > for any x e At. 

Let Q(At) be the set of all (non-negative) finite measures on At. Let u e Q(At) and 
a measurable set A c At, then ua is defined as ua{B) := u{B n A) for any measurable set 
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B c M. Let u,v € Q(Ai) and a measurable set A c A4. Say u is a component of u on A (or 
UA is a component of va), denoted by ua < va, if and only if va-ua^ Q(A^). 

Assumption 1: For any u,v € P(A^), if < ri[o,M); then 

s{x,u) > s{x,v), Vxe[0,M]. (6) 

Biological interpretation: Note that v is supported relatively on larger values than u. In 
Kingman’s model, fitter types are represented by larger numbers. In this spirit, a population 
with distribution v is fitter than the one with u. So given an individual with any type value 
X, it should have less selective advantage in v than in u, which justifies the assumption (6). 

For some po,Q, the limit theorem requires only the assumption 1: 

Theorem 2. Under assumption 1, ifpo - 6 m, then {pi)i>o converges strongly to a probability 
measure p* e P(AI) and p* depends only on M,q,l3. Moreover (pi)[o,M) ^ (Pi+i)[o,M) ^ 
any i>0. 


This specific p* will turn out to be the unique limit distribution in the general model. 

Proposition 1. Under assumption 1, for any po,q, ifpi{{M}) converges top*{{M}), then 
{Pi)i >0 converges strongly to p*. 

Corollary 1. Under assumption 1, for any po,q, if p*{{M}) = 0, then {pi)i>o converges 
strongly to p*. 

Theorem 3. Under assumption 1, if q{{M}) > 0, then {pi)i>o converges strongly to p*. 

The following corollary gives a practical criterion for strong convergence. 

Corollary 2. Under assumption 1, for any pQ,q, if there exists a measurable function f{x) 
on M. such that p*{dx) = f{x)q{dx). Then {pi)i>o converges strongly to p*. 

The proof is straightforward: if p*{{M}) = 0, then Corollary 1 applies. If p*{{M}) > 0 
then q{{M}) > 0. In this case Theorem 3 applies. 

3.2.2 Assumption 2 

Assumption 2: For any u,v € P(AI) and 0 < s < 1, if V[q^m) ^ and there exists 

0 < a < M such that Du{a) > Dy(a) + e, then there will be a number c{a,e) > 1 such that 

s{M,u) > s{M,v)c{a,£). (7) 

Biological interpretation: Distribution u puts more weight on [0,M) and especially £ more 
on [0,a]. So a population with type distribution u is substantially less fitter than the one 
with V. This substantiality is explained by c(a,£) > 1 at point M. 

Based on assumptions 1 and 2, one can prove limit theorem for 0 < po{{M}) < 1, q({M}) = 

0 . 

Theorem 4. Under assumptions 1, 2, f/ 0 < pq{{M}) < l,g({M}) = 0, then for any 
0 < a < M, (pi)[o,a] converges strongly to (p*)[o,a]- a consequence, {pi)i>o converges 
weakly to p*. 
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3.2.3 Assumption 3 

Now one can proceed to assumption 3 which will solve the problem for the last situation: 
Po{{M}) = q{{M}) = 0. A few notations are needed. For any h e P(A1), let /i“ = /i[o,a) + 
/i([a,for any 0 < o < ruh and /i“ = /i for a > rrih. Given q e P(A1) and 0 < a < M, 
let po = 5a be the initial type distribution and the mutation type distribution, with pi 
the type distribution of generation i. Note that niga < rup^ = a. By Theorem 2, {pi)i>o 
converges strongly to a limit probability measure, denoted by Remark that p^'* = p*. 
The assumption 3 is the following: 

Assumption 3: In the above framework, p°‘’* converges weakly to p* as a tends to M from 
left. 

Biological interpretation: This assumption says that the population is healthy and stable, 
in the sense that, by a slight type value decrease among the fittest individuals both in pq 
and/or q, the limit type distribution should not be changed too much. This assumption 
shows the stability of population resistant to type value perturbance. 

Theorem 5. Under assumptions 1,2 and 3, if pq{{M}) = q{{M}) = 0, then the same 
conclusion as in Theorem 4 follows. 

Therefore, under assumptions 1,2,3, whatever pQ,q are, {pi)i>o converges at least weakly 
to a unique limit distribution p* which depends only on q, M, (3. In particular, if pi{{M}) = 0 
for any i > 0 but p*{{M}) > 0, then we say the condensation phenomenon emerges (see [7] 
for a study of the phenomenon in Kingman’s model and related references therein to other 
models, e.g., on Bose-Einstein condensation). 

In section 5, we will prove that Kingman’s model and Lenski model satisfy the three 
assumptions and therefore limit theorems are obtained, especially Theorem 1 for Kingman’s 
model (using Corollary 2 when it comes to strong convergence). 


4 Proofs 

4.1 Proofs of Theorems 2, 3, Proposition 1 and Corollary 1 

We shall first begin with a technical lemma, which will be used frequently in the sequel. 

Lemma 1. Let only the assumption 1 he true. Let Kq and h be two initial type distributions. 
The type distribution of generation i is denoted respectively by hi, hi. The mutant type 
distribution is q in both cases. If (/io)[o,M) ^ (^o)[o,M)(®^<^ hence /io({M}) > /io({M})), 
then for any i > I, (/ii)[o,M) ^ (^j)[o,M) {and hi{{M}) > hi{{M})). 

Proof. Assume that for some generation i > 0, we have (/ij)[o,M) ^ (^j)[o,M)- By ( 6 ), we 
have 

s{x,hi) > s{x,hi),yx € M., (8) 

together with (/ii)[o,M) ^ {hi)[o,M)- 

{s{x,hi)hi{dx)) <{s{x,hi)hi{dx)) . (9) 
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Then using (5), it is clear that (^j+i)[o,M) ^ (^i+i)[o,Af)- So the induction suffices to prove 
the Lemma. 

□ 


Proof of Theorem 2. In Lemma 1, let hi = pi and hi = pi+i for i > 0. Then (/ii)i>o, {hi)i>o 
satisfy the conditions in Lemma 1. Therefore, we obtain (pi)[o,M) ^ (Pj+i)[o,M)! ^ 0. 

Hence there exists a set function p* on the measurable sets of Ai 

sup \p*{A)-pi{A)\^0. 

any measurable set 

So p* must be a probability measure on Ai and (pi)[o,M) ^ (P*+i)[o,M) ^ {P*)[o,M)- ^ 

Corollary 3. Let pQ he the Dirac measure at M. Then 

s{x,pi) increases on i and s{x,p*) > sup{s(x,pi)} for any x e Ai. (10) 

2>0 


And the probability measure p* satisfies: 

p*{dx) = {I - /3) sup{s{x,pi)}p*(dx) + Pq{dx), (11) 

2>0 

and 

p*{dx) < (1 - /3)s{x,p*)p*{dx) + /3q{dx). (12) 

In particular if for any x e Ai, w{x,-) is a continuous function on P(A1) equipped with the 
total variation norm, then (12) is in fact an equality. 

Proof. By Theorem 2 and ( 6 ), we obtain (10). Recall (3): 

Pi{dx) = (1 - I3)s{x,pi-i)pi-i{dx) + I3q{dx),i > 1. 

Due to Theorem 2, pi and pi-i converges strongly to p* and s{x,pi-i) converges increasingly 
to supj>Q{s(a;,pi)} for x e AI hxed as i tends to infinity. So in the limit, using also (10), we 
obtain (11) and (12). The last point is easy to see using some continuity arguments. 

□ 

Proof of Proposition 1 . Let po = and pi the type distribution at generation i > 1 under 
mutant type distribution q. In Lemma 1, let hi = pi and hi = pi for any z > 0. It is easy to 
see that Lemma 1 applies. So we have for any z > 0 

{Pi)[o,M) < (Pi)[o,M) and 0 < Pi{{M}) < pi{{M}). (13) 

By Theorem 2, {pi)i>o converges strongly to p*. If pi{{M}) converges to p*{{M}), using 
(13) and the fact that / Pi{dx) = f Pi{dx) = 1, {pi)i>o converges strongly to p*. 

□ 


Proof of Corollary 1. In the proof of Proposition 1, since pj({M}) tends to p*({M}) = 0, 
using (13), we also obtain pi{{M}) converges to p*{{M}) = 0. The strong convergence 
follows by Proposition 1. □ 


Proof of Theorem 3. Note that pi{{M}) > 0. For convenience we assume po({M}) > 0. 

Let {pi)i>Q be the same as in the proof of Proposition 1 and hence (13) holds. Let 

Rip{x) = {I- -^*, 2 ( 3 ;) =/3 -/3)^ n (14) 

k=0 k=0 j=0 

and 

2 — 1 2—1 k — \ 

Ti^i{x) = {I-l3yY\s{x,pk):\-x=M, Ti, 2 (a:) =/3 -/3)^ n (1^) 

fc =0 fc =0 j =0 

Then it follows from (5) that 

Pi{{M}) = i2i,i(M)po({M}) + i?i,2(M)g({M}), (16) 

and 

Pi{{M}) = Ti^i{M)po{{M}) +Ti^2{M)q{{M}) = Ti^i{M) + Ti^2{M)q{{M}). (17) 

By Theorem 2 and assumption 1, s{x,pi) < s{x,pi+i) for any x € M. and i > 0. Then 
we must have s{M,pi) < 1/(1 - /3) for any i > 0, otherwise (rj^i(M))j>o will tend to infinity 
which is impossible. Consequently, (rj^i(M))j>o decrease either to 0 or to a strictly pos¬ 
itive value. But in fact, the second option is impossible. Otherwise, as i goes to infinity, 
s{M,pi) must converge to 1/(1 -/3) which entails that lim -_ Tj^fM) = 00 and so 00 = 
lim , ^^Ti, 2 {M)q{{M}) = lim , _ i5j,(|M|) (since q{{M}) > 0). Therefore (Ti,i({M}))i>o 
converges to 0. 

By (13) and assumption 1, we have s{x,pi) > s{x,pi) for any x e A4,i > 0 which entails 
Ri, 2 {M) > Ti^ 2 {M), and i?i,i(M) < Ti^i{M)lpo{{M}) to ensure that pi{{M}) <pi{{M}). 
Therefore as i goes to infinity, i?.jq(M) converges to 0 and pi({M}) converge to p*{{M}). 
Strong convergence follows by Proposition 1. 

□ 


4.2 Proof of Theorem 4 

Proof of Theorem 4- Let {pi)i>o be the same as in the proof of Proposition 1 and then (13) 
holds. If p*{{M}) = 0, then strong convergence follows due to Corollary 1. So here we 
assume p*{{M}) > 0. Note that po{{M}) = 1, q({M}) = 0. So (16) and (17) become 

K({M}) = i2,,i(M)po({M}), pi{{M})=Ti^i{M). (18) 

By the same arguments in the proof of Theorem 3, Ti^i({M}) decreases to p*{{M}) (which 
is strictly positive by assumption) and Tj_i(M) < Ri^i{M) < Ti^i{M)j pq{{M}) for any i > 0. 

Let 0 < a < M. Due to assumption 2, for any e > 0 and any if pj([0,a]) -pi([0,a]) > e 
(the difference is always nonnegative due to (13)), then 

s{M,pi) > s{M,pi)c{a,e), with c(a,e) > 1. (19) 

However, the above display can hold only for finitely many is. Otherwise, Ri^i{{M}) will 
tend to infinity. That implies limj^oopj([0,a])-pi([0,a]) = 0. So together with (13), (pi)[o,a] 
converges strongly to (p*)[o,a]- h therefore follows the weak convergence of {pi)i>o top*. □ 


9 


4.3 Proof of Theorem 5 


We start with a lemma. 

Lemma 2. Let only the assumption 1 he true. Letpo andpQ be two initial type distributions 
such that 0 < mp^ < mp^ and (po)[o,mpp) ^ (Po)[o,mpp)- type distributions of generation 
i are denoted respectively by pi, pi. Assume that the mutation type distribution for pQ is 
q and for po is q such that q = q'^^o. Then for any i > 1, {pi)[o^mp^) < {Pi)[o,mp^) and 
= {Pi)imp^,M] ^ iPi)(mp^,M] ^(rripoM] measure on (mpo,M]. 

Proof. The proof follows the same lines in the proof of Lemma 1, using assumption 1. □ 

Given initial type distribution po with mutant type distribution q and another couple 
Po,q, dehne a function A : {po,Q,Po,q) {0)1} such that A(po,q,po,q) = 1 if and only if 
these distributions satisfy the assumptions in Lemma 2. 

Proof of Theorem 5. Let 0 < a < b < M . Let Lq = (po)^- Let hi be the type distribution 
of generation i with ho as initial distribution and q^ as the mutant distribution. Then 
m^b < mho ~ ^ > 0. Due to Theorem 3 and Theorem 4, (hj)j>o 

converges strongly to p^’* on any [0,a]. 

It is easy to verify that A{ho,q^,po,q) = 1 and A{po,q,5 M,q) = 1- Let {pi)i>o be the 
same as in (13). We could apply Lemma 2 which gives that 

{Pi)[o,b) < {Pi)[o,b) < {hi)[o,b),^i ^ 0 

and in consequence 

iPi)[0,a] ^ iPi)[0,a] ^ (d)[0,a]- (20) 

According to assumption 3, p^’* converge weakly to p* as b tends to M. Then by Port¬ 
manteau theorem ([!]), limfe^MP^’’^([0,a]) < p*([0,a]). Note that as i tends to infinity, 
{hi)[o,a] converges strongly to (/’d[o,a], so as (pi)[o,a] to (pd[o,a]- Together with (20), 
(p*)[ 0 ,a] ^ (/’d[ 0 ,a]- Therefore (/’d[o,a] converges strongly to (p’')[o,a] as b tends to M 
from left. Since h can be any number between a and M, using again (20), (pi)[o,a] converges 
strongly to (p’^)[o,a]- Then the proof is completed. □ 


5 Applications 


5.1 Kingman’s model 


We verify the assumptions 1,2 and 3 for Kingman’s model. Recall that M = 1. 

Assumption 1: It is clear that u< v (i.e., Du{x) > Dy{x), Vx > 0). Then / xu{dx) = /(l- 
Du{x))dx < J{1-Dy{x))dx = f xv{dx) which implies s{x,u) = > s{x,v) = 

for any x e A4. 


Assumption 2: We have 

y' xu{dx) = J{1 - Du{x))dx < y" {1 - Dy{x) - e±^^a,i){^))dx = xv{dx) - s{l - a) 

< J xx((ix)(l - e(l - a)) 
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1 


where the second inequality is due to 0 < / xv{dx) < 1. So 


s{M,u) 


M ^ M 1 

/ xu{dx) f xv{dx) 1 - - a) 


s{M,v) 


1 - e(l - a) 


One can take c{a,e) = to satisfy assumption 2 . 

Assumption 3: We need to prove that converges weakly to p* as a tends to 1. We 
shall derive explicit formulas for and p*. According to Corollary 3, it is easy to see that 


p‘^’*{dx) = (1-/3) 0 < a< 1. (21) 

J xp°‘' {dx) 

That is, 

Note that / xp°‘’*{dx) determines By monotonicity, the above equation has a unique 
solution p“’*. A simple calculation gives the following: 


Lemma 3. Following the above discussion, for any 0 < a < 1, 


1- If j > 1; then p‘^’*{dx) - where Sa is the unique solution of equation 

_ f 0Saq'^{dx) 

~ I Sa-(l-h)x ■ 


2 . 




I3q°-(dx) 


I3q°-{dx) 


xja ) ■ 


Remark 3. Notice that if we replace a by 1, then we obtain the same limit distributions as 
in Theorem 1. 


Now we can verify the assumption 3 by Lemma 3. 

Corollary 4 (assumption 3). As a tends to 1 from left, p°‘'* converges weakly to p*. 

Hence all three assumptions are satisfied by Kingman’s model. At least weak convergence 
of {Pi)i>o to p* is guaranteed. In particular, Corollary 2 ensures the strong convergence when 
/ > /3~^ ill Theorem 1 . That covers Kingman’s result. 


5.2 Lenski model 

We would like to apply the results from general model (3) to Lenski model. As we will show 
later (Theorem 6 ), given any po, q, {pi)i>o converges at least in weak sense to a unique limit 
distribution which depends only on q,M,/3. One can imagine that, at some day, the type 
distribution is po, and then next day a positive proportion is replaced by a fitter mutant 
population with distribution q. Assume this procedure can iterate for many days. Then 
we shall observe that as i tends to infinity that the population gets fitter and fitter but 
in a deceleration speed to achieve some limit. Note that at the early stage (around 10,000 
generations), a hyperbolic curve, which has a bound, fits to the data. Our general model also 
has a bound, in the spirit of hyperbolic curve. As time moves on, fitter mutant distribution 
pattern will appear and the previous bound will be broken up. We shall then in the long 
term see that the type values increase unboundedly and also sublinearly, since fitter mutant 
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distribution patterns are rarer. So our model provides another point of view to explain the 
increasing trend of the type values. 

We shall prove that assumptions 1 to 3 are satished by Lenski model and hnd the unique 
limit distribution. 


Assumptions 1 and 2: One just needs to follow the same procedure as for Kingman’s 
model by using distribution functions. 

Assumption 3: The same as for Kingman’s model, we give the expression of p^"'* for any 
0 < a < M. Similarly, we have 


p^’*{dx) = (1 - 13)- - ^ ^ ’ + Pq\dx) 


(23) 


which has a unique solution 

In the same spirit of Lemma 3 and Corollary 4, we have the corresponding results in 
Lenski model. 


Lemma 4. Following the above diseussion, for any 0 < a< M 


1. If f ^ ^ p°‘’*{dx) = where Sa is the unique solution of 




equation 1 = / 


g Jf C _hqf(dx)_ ^ ,r Q f pq-^ax) ^ . 


l3q°-{dx) 


fiq°‘{dx) 


Corollary 5 (assumption 3). In the Lenski model, converges weakly to p* as a tends 
to M. 


So all three assumptions are satisfied by Lenski model, which guarantees the weak con¬ 
vergence of {pi)i>o to p* . Together with Corollary 2 and replacing a by M in Lemma 4, we 
obtain the main result as follows: 

Theorem 6. 1. If f {pi)i>o converges strongly top*{dx) = 

where Sa is the unique solution of equation 1 = f ■ 

•I 2 - tL^ScL^ 

7 

VI ^ {pi)i>o converges at least weakly to p*{dx) = + 


6 Discussions 

The main generalization in our model is the dependence of the htness function on the current 
type distribution. Biologically, it means that the htness of an individual depends closely on 
his contemporaries. In other words, macroscopic epistasis enter explicitly into effect. That 
allows us to design different htness functions according to our purposes, such as in the case 
of Lenski experiment. Moreover the assumptions made on the htness function biologically 
make sense and are easy to verify, which reduces the modelling difficulties of other potential 
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biological experiments. In particular the generalization is also a simplification, because the 
tools employed in our approach such as the stochastic dominance and some convergence 
notions of (probability) measures are simple but good enough to reveal the nature of the 
asymptotic behavior of type distributions. 

Notice that we always assume M < oo. However the iteration (3) can be well defined with 
carefully chosen poj <1-, w when M = oo. Kingman [10] studied an example in the infinite case. 
However, no general dynamics exist and the only thing one can expect is that a certain 
proportion of the mass of pi moves to the right as far as possible as i increases. 

Burger [2, 3] considered fitness functions which does not depend on current type distri¬ 
bution and gathered some assumptions on the fitness function to make it possible to work 
on M < oo. It turns out that, under appropriate conditions, {pi)i>o converges strongly to 
a unique limit distribution. However this generalization does not cover neither Kingman’s 
Case 1 nor the situations with condensation phenomenon. 

The Lenski experiment has attracted much attention recently, see for instance a survey by 
Ycart et al [17] and references therein. Another paper [8] builds a very stochastic approach 
to analyze the experiment and our model can be considered as its deterministic counterpart. 
There are advantages to stay stochastic and also to stay deterministic. In [8], the authors 
considered a population with finite population N. Each time a mutation can only affect one 
individual. There arises the interesting question of studying the random evolution of mutant 
population size compared to the total population size. Another advantage is that one can 
ignore the boundedness concern of the type values if we assume each mutation brings very 
slight changes to the type value. To stay deterministic, one can quickly have a clear idea 
on how the two forces, selection and mutation, compete in the game. In particular, the 
macroscopic epistasis is designable and the deterministic general model can cover various 
other evolutionary models. In view of this comparison, a future research topic would be 
considering how to merge the stochastic and deterministic models in order to provide a 
clear global image and also allow dynamical randomnesses. 
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